IBM MASTOR System: Multilingual Automatic Speech-To-Speech Translator
نویسندگان
چکیده
In this paper, we describe the IBM MASTOR, a speech-to-speech translation system that can translate spontaneous free-form speech in real-time on both laptop and hand-held PDAs. Challenges include speech recognition and machine translation in adverse environments, lack of training data and linguistic resources for under-studied languages, and the need to rapidly develop capabilities for new languages. Another challenge is designing algorithms and building models in a scalable manner to perform well even on memory and CPU deficient hand-held computers. We describe our approaches, experience, and success in building working free-form S2S systems that can handle two language pairs (including a low-resource language).
منابع مشابه
Recent Advances of IBM’s Handheld
Recent Advances in the processing capabilities of Personal Digital Assistants (PDAs) have enabled the implementation of an end-to-end speech translation system on these devices. We have presented a bi-directional speech-to-speech (English and Chinese) translation system, which is hosted on a PDA running embedded Linux. Our Multilingual Automatic Speech-toSpeech Translator (MASTOR) system includ...
متن کاملRecent advances of IBM's handheld speech translation system
Recent Advances in the processing capabilities of Personal Digital Assistants (PDAs) have enabled the implementation of an end-to-end speech translation system on these devices. We have presented a bidirectional speech-tospeech (English and Chinese) translation system, which is hosted on a PDA running embedded Linux. Our Multilingual Automatic Speech-to-Speech Translator (MASTOR) system include...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملMultilingual Generation for Translation in Speech-to-Speech Dialogues and its Realization in Verbmobil
This paper presents the generation module of the speech-to-speech dialogue translation system Verbmobil. Spontaneous speech, large multilingual vocabulary, difficulty of the translation task, robustness and real-time constraints make the design of such a module very challenging. In order to overcome these difficulties, we have developed a system based on a general kernel and the declarativity o...
متن کاملDealing With Multilinguality In A Spoken Language Query Translator
Robustness is an important issue for multilingual speech interfaces for spoken language translation systems. We have studied three aspects of robustness in such a system: accent differences, mixed language input, and the use of common feature sets for HMM-based speech recognizers for English and Cantonese. The results of our preliminary experiments show that accent differences cause recognizer ...
متن کامل